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Quality control method for manufacturing biopolymer arrays^ 



Description 

The invention relates to a quality control method for manufacturing 
biopolymer arrays comprising the use of detectable protecting groups. 

The synthesis of nucleic acids and peptides on a solid phase has been an 
established process for the last 20 years. The most prevalent method of 
nucleic acid synthesis is the phosphoramidite method of Beaucage and 
Caruthers, Tetrahedron Lett. 22 (1981), 1859-1862, where the 
oligonucleotide chain is built up by the repetitive condensation of individual 
nucleotide building blocks in the 3' or 5' direction. A variety of orthogonal 
protecting groups are used to protect three reactive nucleotide groups: the 
ribose sugar 5' hydroxyl group, the amino protecting group of the 
nucleobases adenine, guanine and cytosine (thymine does not need a 
protecting group), as well as the phosphate protecting group of the 
nucleotide 3' phosphate residue. These protecting groups are then cleaved 
off under varying conditions, either during or after the synthesis. The 4,4'- 
dimethoxytriphenylmethyl (DMT) group has become the standard for 5' 
hydroxyl, the 2-cyanoethyl group the standard for phosphate fesidue and 
various acyl groups the standard for the amino functions of the 
nucleobases according to Buchi and Khorana (J. Mol. Biol. 72 (1972), 251- 
258) and Souveaux (in: "Methods in Molecular Biology" Vol. 26, Chap. 1 
Protocol for Oligonucleoside Conjugates, Ed. S. Agrawal, Humana Press 
Inc., Totowa, N.J. 1994). The DMT group is cleaved off during synthesis 
in order to generate an hydroxyl group to which the next phosphoramidite 
can bind. The other named protecting groups remain until the end of the 
synthesis in order to prevent any side-reactions or by-products. At the end 
of the synthesis the complete oligonucleotide is completely deprotected 



largely by means of a base, whereby 2-cyanoethyl and acyl protecting 
groups are cleaved off. 

There are essentially two means of producing biochips: off-chip and on- 
chip synthesis of oligonucleotide probes. In off-chip synthesis, the 
oligonucleotide is produced on a commercially available synthesizer using 
the above-mentioned standard reagents and then immobilized on the chip. 
In on-chip synthesis, the oligonucleotide is produced directly on the chip 
using the above-mentioned standard reagents. In the former case, the 
quality of the oligonucleotide can be analysed by means of analytical 
processes such as HPLC or mass spectrometry and, where necessary, 
improved via purification. The later case of on-chip synthesis allows for 
only limited quality control and purification is not possible. Quality control 
is normally only possible by means of the covalent binding of a (mainly 
fluorescent) colouring material at the terminus of the oligonucleotide, 
which can then be detected and quantified. 

There is scarcely any mention in the literature of processes for determining 
the deprotection degree of oligonucletides. A first method comprises the 
detection of nucleobases and 5'-hydroxyl protecting groups on 
oligonucleotides with monoclonal antibodies, Fu et al.. Analytical 
Biochemistry (2002), 306(1), 135-143). Other references describe the use 
of cleavable, fluorescent protecting groups. These refer, however, to 
protecting groups for the 5'-hydroxyl group or the phosphate residue, but 
not for the nucleobase amino groups. In this context, it should be noted 
that US Patent 6,238,862 B1 describes a fluorescent, photo labile 
protecting group for the 5'-hydroxyl group used for determining the 
synthesis efficiency of the DNA synthesis array. Wagner and Pfleiderer 
(Helv. Chim. Act. 80 (1990), 200-212) describe a nucleobase protecting 
group which basically has fluorescent properties. However, this group has 
been developed for other purposes, especially for improved deprotection 
properties by using a ^-elimination reaction process. 



As a rule, exact quality controls cannot be carried out with on-chip 
synthesis. Presently, only the binding of dyes to the terminus of the 
oligonucleotide gives an indirect indication of the quantity of solid phase 
oligonucleotide and of the quality of the synthesis. This does not, however, 
provide any indication of the rate of the deprotection at the end of the 
synthesis, i.e. whether all 2-cyanoethyl and acyl protecting groups were 
cleaved off. It is known that, under standard conditions, the nucleobase 
acyl protecting groups cannot always be quantitatively cleaved off and part 
of the amino groups therefore remain protected. Complete deprotection of 
the amino groups is however essential for the optimal application of the 
chip. Should not all of the amino function protecting groups be cleaved off, 
a subsequent hybridisation can be decisively impaired, since the formation 
of Watson-Crick or Hoogsteen base pairs will be inhibited. 

Thus, the invention relates to a quality control method for manufacturing 
biopolymer arrays comprising: 

(a) synthesizing a plurality of different biopolymer species on an 
array from monomeric or oligomeric building blocks 
comprising detectable protecting groups; 

(b) optionally carrying out a determination of the detectable 
protecting groups on the array after synthesis; 

(c) cleaving off the detectable protecting groups, and 

(d) carrying out a determination of the detectable protecting 
groups on the array after cleavage; in order to determine the 
efficacy of deprotection. 

These simple steps would enable the quality control of the synthesised 
biopolymers. The main focus is quality control of the completeness of the 
biopolymer deprotection. The key advantage is that the methodology is 
non-destructive and requires no further steps once the final deprotection or 
detection has been carried out. The outcome is an evaluation of the rate of 



deprotection as well as of the quality of freely accessible biopolymers for 
later use. 

Generally, the invention relates to the use of detectable protecting groups 
in the manufacture of biopolymer arrays. The term "biopolymer" as used in 
the present application particularly relates to nucleic acids such as DNA or 
RNA or nucleic acid analogues such as peptide nucleic acids (PNA) or 
locked nucleic acids (LNA) or combinations thereof. The term, however, 
also relates to peptides and peptide analogues as well as to other 
biopolymers such as carbohydrates or any combinations thereof. Preferably 
the biopolymer species are selected from nucleic acids and nucleic acid 
analogues wherein the detectable protecting groups are coupled to 
nucleobases, particularly to amino groups of nucleobases. 

The term "array" as used in the present application relates to a solid 
support e.g. a planar or non-planar solid support. The solid support may 
have a surface selected from metals such as silicon, metal oxides such as 
silica, glass or plastic or any combination thereof. Preferably the array is a 
chip. 

The method of the present invention comprises synthesizing a plurality of 
different biopolymer species on the array from monomeric or oligomeric 
building blocks. For example, nucleic acids may be synthesized from 
phosphoamidite or phosphonate building blocks as known in the art. The 
synthesis is a spatially directed synthesis, e.g. a synthesis of different 
biopolymer species on different locations on the carrier. Methods for 
spatially directed biopolymer synthesis include, without limitation, light- 
directed synthesis, microlithography, application by inkjet, microchannel 
deposition to specific locations and sequestration with physical barriers. In 
general, these methods involve generating active sites on the carrier, 
usually by removing protecting groups and coupling to the active site a 



monomeric or oligomeric building block which, itself, optionally has a 
protected active site if further coupling of building blocks is desired. 

In a preferred embodiment, the synthesis of the biopolymer species is 
carried out in a manner that the detectable protecting groups remain on the 
biopolymer species until the synthesis has been terminated. It should be 
noted, that the synthesis can be carried out using standard protocols. 

After synthesis has been terminated a first determination of the detectable 
protecting groups on the array may be carried out. By this means, 
qualitative and/or quantitative determination of the biopolymer species on 
the array is possible. The first determination step is preferably a spatially 
resolved determination step wherein a qualitative and/or quantitative 
determination of detectable protecting groups is carried out separately on 
different locations of the array. Techniques for spatially directed detection 
procedures may comprise the use of spatially resolved detectors e.g. 
microscopes or detector matrices such as CCD imaging systems allowing 
a parallel determination on a plurality of locations on the array. 

After the optional first determination the detectable protecting groups are 
cleaved off. The cleavage may be carried out by known protocols 
according to the nature of the respective detection groups e.g. by 
photochemical methods such as irradiation, or by chemical methods such 
as acid or base treatment. In this context it should be noted that the 
detectable protection groups are preferably selected such that they are not 
cleaved off during the biopolymer synthesis procedure. 

After the cleavage of the detectable protecting groups a determination of 
the detectable protecting groups is carried out on the array in order to 
determine the efficacy of the protection. This determination may be 
qualitative and/or quantitative. If a first determination is carried out, the 
difference between the amount of protecting groups before and after 
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cleavage is an indication of the amount of deprotected biopolymer and thus 
an indication of the deprotection rate. When no more protecting groups are 
detected after deprotection the deprotection has been quantitative, 
otherwise a repetition of deprotection may be necessary. 

5 

In a first embodiment of the invention the detectable protecting groups are 
selected from fluorescent groups, e.g. groups comprising pyrene, dansyl, 
stilbene, rhodamine and/or coumarine moieties. An important feature of the 
fluorescent groups is that they are stable during oligonucleotide synthesis. 
10 For example, the fluorescent moiety may be combined with an acyl, e.g. a 
tert-butylphenoxyacetyl protecting group or another amino protecting 
group. 

In a further embodiment the detectable protecting groups may be selected 
15 from radioactively detectable protecting groups. For example, radioactively 
detectable groups are selected from groups comprising 14 C, 32 P and 3 H 
doped moieties which may be combined with a suitable protecting group, 
e.g. an amino protecting group. 

20 In a still further embodiment the detectable protecting groups may be 
selected from electrochemically detectable protecting groups, e.g. groups 
comprising ferrocene and/or phenothiazine moieties. These moieties may 
be combined with suitable protecting groups, e.g. amino protecting groups. 
The determination of such protecting groups may be carried by 

25 electrochemical methods, e.g. voltammetry. 

In a still further embodiment of the present invention the detectable 
protecting groups may be selected from UV or IR detectable protecting 
groups, e.g. from groups comprising aromatic nitro moieties, hydroxyl 
30 moieties, thiole, thioether or thiophenole moieties, nitrile moieties, 
isocyanate moieties and/or halo moieties. UV and IR detectable protecting 
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groups are preferably detectable independently from the biopolymer 
species which has been synthesised on the array. 

In a still further embodiment the detectable protecting groups are selected 
from bioaffine detectable protecting groups. Bioaffine detectable protecting 
groups comprise moieties selected from partners of a bioaffine binding pair 
which can be detected by their specific interaction with the respective 
other partner of the binding pair. Specific examples of bioaffine detectable 
protecting groups are protecting groups comprising biotin, digoxin or 
digoxigenin moieties. Biotin may be detected by its bioaffine interaction 
with streptavidin or avidin or with anti-biotin-antibodies. Digoxin or 
digoxigenin may be detected by their specific interaction with respective 
antibodies. 

In an especially preferred embodiment of the present invention the building 
blocks for the biopolymer synthesis are nucleotide building blocks having 
the general structural formulae I or II: 
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wherein R 1 is an hydroxy protecting group, 

R 2 is -H, -{CVCW-alkoxy, -(C 2 -C 10 )-alkenyloxy, -(C 2 -C 10 )-alkynyloxy, -halo, - 
azido, -NHR7, -SR7 or -OR7, wherein R7 is a protecting group or a reporter 
group, 

R 3 is a phosphate, an H-phosphate or other phosphate analogue group 
which may contain a protecting group. 



B is a nucleobase or a nucleobase analogue, 
n is 0 or 1 , and 

L is a detectable protecting group, e.g. of the structure -C(0)-R, when 
n = 1, or = CH-NR 8 R, when n = 0, wherein R is the residue of the reporter 
group and R 8 is selected from the group consisting of -(C^CaJ-alkyl. 

In the compounds (I) or (II) the 5' or 3' hydroxy protecting group R 1 is 
preferably selected from optionally substituted triphenylmethyl groups, e.g. 
4,4'-dimethoxy triphenylmethyl or 4-monomethoxy triphenyl, pixyl groups, 
photocleavable groups, e.g. p-nitrophenylpropoxy carbonyl (NPPOC) or (a- 
methyl)-6-nitropiperonyloxy carbonyl (MeNPOC), and substituted silyl 
protecting groups, e.g. tert-butyldimethyl silyl (TBDMS) or tert- 
butyldiphenyl silyl (TBDPS). 

The group R 3 is a phosphate or phosphate analogue group which may 
contain a protecting group, preferably a phosphitamide group, more 
preferably a group -P{R 6 )NR4R5, wherein R 4 and R 5 are independently 
selected from the group consisting of -H, -(C r C 10 )-alkyl, -{C 2 -C 10 )-alkenyl, - 
(C 6 -C 22 )-aryl, or wherein NR4R5 can form together with N a 5-6-membered 
heterocyclic ring, 

R 6 is selected from the group consisting of -(C 2 -C 6 )-alkenyloxy, -(C 2 -C 6 ) 
alkenyl, -(C^CgJ-alkyl, or -{(VCgJ-alkoxy, 

wherein each group may contain one or several substituents selected from 
-halo, p-nitroaryloxy and -cyano or wherein R 6 is -H. 
R 6 is most preferably a 2-cyanoethyloxy group. 

B is a nucleobase or nucleobase analogue having at least one amino group, 
e.g. adenine, guanine, cytosine, or a corresponding nucleobase analogue. 
The nucleobase or nucleobase analogue is preferably capable of hydrogen 
bridge formation with a complementary nucleobase after incorporation into 
a nucleic acid molecule. Suitable nucleobase analogues are mono- and 
bicyclic heterocycles comprising at least one amino group wherein the 
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heterocycle is different from natural nucleobases as described by Simons, 
(Advanced 2001 Chemistry Texts, Nucleoside Mimetics, Gordon and 
Breach Science Publishers, Amsterdam 2001 , Chapter 4), for example aza 
analogues of naturally occurring nucleobases, wherein a CH-moiety of a 
purine or a pyrimidine ring is replaced by nitrogen (such as 8-aza- 
adenosine) or deaza analogues of naturally occurring nucleobases wherein 
an N atom in the ring is substituted by a CH group (such as 7-deaza- 
guanosine) or combinations of aza and deaza substitutions (such as 8-aza- 
7-deaza-guanosine). Further preferred nucleobase analogues are C- 
nucleosides, e:g.* deaza nucleobases wherein the N9 atom of a purine base 
or the N1 atom of a pyrimidine base respectively is substituted by a carbon 
atom (in the sp 2 configuration) such as formycine and pseudoisocytidine. 
Further, suitable nucleobase analogues may carry additional amino groups, 
e.g. 2-aminoadenosine. Further, the nucleobase or nucleobase analogue 
may be substituted, wherein the substituted, e.g. a C,-C 3 alkyl or alkoxy, 
a C 2 -C 3 alkenyl or alkenyloxy, a C 2 -C 3 alkynyl or alkynyloxy and/or a halo 
substituent, is preferably compatible with the formation of hydrogen 
bridges to a complementary base. Preferred positions for substituents are 
C5 in pyrimidine bases, C8 in purine bases and C7 in deaza purine bases. 

Furthermore, the present invention relates to a nucleic acid synthesis 
building block having the general structural formulae I or II as described 
above. 

The building blocks are suitable for the production of nucleic acid arrays. 

Furthermore, the invention relates to a reagent kit for the synthesis of 
nucleic acid arrays comprising at least one nucleic acid synthesis building 
block as described above. For example, the reagent kit may comprise at 
least two different nucleic acid synthesis building blocks each carrying 
different detectable protecting groups. The different detectable protecting 
groups are preferably detectable independent from each other. For 



example, the reagent kit may comprise at least two different independently 
detectable fluorescent protecting groups, or combinations of different 
embodiments of detectable protecting groups, e.g. fluorescent and 
bioaffine detectable protecting groups or other combinations of different 
protecting groups. 

Further, the invention shall be explained in more detail by the following 
Figures. 

Fig. 1. Fluorescent, pyrene-labelled cytidine building block. The 

pyrene-containing nucleobase protecting group exhibits 
fluorescent properties and can therefore be detected by 
means of optical analysis procedures. 

If the nucleobase aminofunctions are protected with a 
cleavable acyl group carrying a biotin or digoxigenin labelled 
system, then detection can be carried out using streptavidin 
or anti-digoxigenin. 

Fig. 2. A cytidine building block with an IR detectable nitro aromatic 

moiety. 



Fig. 3. 



A cytidine building block with an electrochemically detectable 
ferrocene moiety. 



Claims 



A quality control method for manufacturing biopolymer arrays 
comprising 

(a) synthesizing a plurality of different biopolymer species on an 
array from monomeric or oligomeric building blocks 
comprising detectable protecting groups; 

(b) optionally carrying out a determination of the detectable 
protecting groups on the array after synthesis; 

(c) cleaving off the detectable protecting groups, and 

(d) carrying out a determination of the detectable protecting 
groups on the array after cleavage; in order to determine the 
efficacy of deprotection. 

The method of claim 1 , wherein the detectable protecting groups are 
selected from fluorescent detectable protecting groups. 

The method of claim 2, wherein the fluorescent detectable 
protecting groups are selected from groups comprising pyrene, 
dansyl, stilben, rhodamine or coumarine moieties. 

The method of claim 1 , wherein the detectable protecting groups are 
selected from radioactively detectable protecting groups. 



The method of claim 4, wherein the radioactively detectable groups 
are selected from groups comprising 14 C, 32 P or 3 H doped moieties. 



The method of claim 1 , wherein the detectable protecting groups are 
selected from electrochemically detectable protecting groups. 
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7. The method of claim 6, wherein the electrochemically detectable 
groups are selected from -groups - comprising ferrocene or 
phenothiazine moieties. 

8. The method of claim 1 , wherein the detectable protecting groups are 
selected from UV- or IR-detectable protecting groups. 

9. The method of claim 8, wherein the UV- or IR-detectable protecting 
groups are selected from groups comprising aromatic nitro moieties, 
hydroxyl moieties, thiol, thioether or thiophenol moieties, hitrile 
moieties, isocyanate or halo moieties. 

1 0. The method of claim 1 , wherein the detectable protecting groups are 
selected from bioaffine detectable protecting groups. 

11. The method of claim 1.0, wherein the bioaffine detectable protecting 
groups are selected from groups comprising biotin, digoxin or 
digoxigenin moieties. 

12. The method of any one of claims 1-11, wherein the biopolymer 
species are selected from nucleic acids, nucleic acid analogues, 
peptides and peptide analogues. 

1 3. The method of claim 1 2, wherein the biopolymer species are 
selected from nucleic acids and nucleic acid analogues and wherein 
the. detectable, protecting groups are coupled to nucleobases. 

1 4. The method of claim 1 3 wherein the detectable protecting groups 
are coupled to amino groups of nucleobases. 
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The method of any one of claims 1 2-1 4, wherein the building blocks 
for the biopolymer synthesis are nucleotide building blocks having 
the general structural formulae (I) or (II): 




wherein R 1 is an hydroxy protecting group, 

R 2 is -H, -(CVC^J-alkoxy, -(C 2 -C 10 )-aIkenyloxy, -(C 2 -C 10 )-alkynyloxy, - 
halogen, -azido, -NHR7, -SR7 or -OR7, wherein R7 is a protecting 
group or a reporter group, 

R 3 is a phosphate, an H-phosphonate or other phosphate analogue 
group which may contain a protecting group, 
B is a nucleobase or a nucleobase analogue, 
n is 0 or 1 , and 

L is a detectable protecting group. 

The method of claim 15, wherein R 1 is selected from optionally 
substituted triphenylmethyl groups, e.g. 4,4'-dimethoxy 
triphenylmethyl or 4-monomethoxy triphenyl, pixyl groups, 
photocleavable groups, e.g. p-nitrophenylpropoxy carbonyl (NPPOC) 
or (a-methyl)- 6-nitropiperonyIoxy carbonyl (MeNPOC), and 
substituted silyl protecting groups, e.g. tert-butyldimethyl silyl 
(TBDMS) or tert-butyldiphenyl silyl (TBDPS). 
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17. The method of claims 15 or 16, wherein R 3 is a phosphite amide 
- - group. - ......... . .. - 

18. The method of claim 12 wherein R 3 is the group -P(R 6 )-NR 4 R 5 
5 wherein R 4 and R 5 are independently selected from the group 

consisting of -H, -<C,-C 10 )-alkyl, -(C 2 -C 10 )-alkenyl, -(C 6 -C 22 )-aryl, or 
wherein NR4R5 can form together with N a 5-6-membered 
heterocyclic ring, and R 6 is selected from the group consisting of - 
(C 2 -C 6 )-alkenyloxy, -(C 2 -C 6 )-alkenyl, -(C,-Ce)-alkyl, -(d-C^-alkoxy, 
io wherein each group may contain one or several substituents 

selected from -halo, p-nitroaryloxy and -cyano or wherein R 6 is -H. 

19. The method of claim 18, wherein R 6 is a 2-cyanoethyIoxy group. 

is 20. The method of any one of claims 1 5-1 9 wherein L has the structure 
-C(0)-R, when n = 1, or =CH-NR 8 R when n = 0, wherein R is the 
residue of the protecting group and R 8 is selected from H and -(C,- 
C 3 )-alkyl. 

20 21 . The method of any one of claims 1 5-20, wherein B is selected from 
adenine, guanine, cytosine, aza and/or deaza analogues thereof, and 
analogues containing additional amino groups. 

22. A nucleic acid synthesis building block having the general structural 
25 formulae (I) or (II): 
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wherein R 1 is an hydroxy protecting group, 

R 2 is -H, -(CVC.oJ-alkoxy, -(C 2 -C 10 )-alkenyloxy, -(C 2 -C 10 )-alkynyloxy, - 
halogen, -azido, -NHR7, -SR7 or -OR7, wherein R7 is a protecting 
group or a reporter group, 
5 R 3 is a phosphate, an H-phosphonate or other phosphate analogue 

group which may contain a protecting group, 
B is a nucleobase or a nucleobase analogue, 
n is 0 or 1 , and 

L is a detectable protecting group. 

23. Use of a building block of claim 22 for the production of nucleic acid 
arrays. 

24. A reagent kit for the synthesis of nucleic acid arrays comprising at 
15 least one nucleic acid synthesis building block of claim 22. 

25. The reagent kit of claim 24 comprising at least 2 nucleic acid 
synthesis building blocks of claim 22 each carrying different 
detectable protecting groups. 
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Figure 3 




